skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Wang, Yiren"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available March 1, 2026
  2. The global COVID-19 pandemic has highlighted the need for rapid, reliable, and efficient detection of biological agents and the necessity of tracking changes in genetic material as new SARS-CoV-2 variants emerge. Here we demonstrate that RNA-based, single-molecule conductance experiments can be used to identify specific variants of SARS-CoV-2. To this end, we i) select target sequences of interest for specific variants, ii) utilize single-molecule break junction measurements to obtain conductance histograms for each sequence and its potential mutations, and iii) employ the XGBoost machine learning classifier to rapidly identify the presence of target molecules in solution with a limited number of conductance traces. This approach allows high-specificity and high-sensitivity detection of RNA target sequences less than 20 base pairs in length by utilizing a complementary DNA probe capable of binding to the specific target. We use this approach to directly detect SARS-CoV-2 variants of concerns B.1.1.7 (Alpha), B.1.351 (Beta), B.1.617.2 (Delta), and B.1.1.529 (Omicron) and further demonstrate that the specific sequence conductance is sensitive to nucleotide mismatches, thus broadening the identification capabilities of the system. Thus, our experimental methodology detects specific SARS-CoV-2 variants, as well as recognizes the emergence of new variants as they arise. 
    more » « less
  3. Abstract Deoxyribonucleic acid (DNA) has emerged as a promising building block for next-generation ultra-high density storage devices. Although DNA has high durability and extremely high density in nature, its potential as the basis of storage devices is currently hindered by limitations such as expensive and complex fabrication processes and time-consuming read–write operations. In this article, we propose the use of a DNA crossbar array architecture for an electrically readable read-only memory (DNA-ROM). While information can be ‘written’ error-free to a DNA-ROM array using appropriate sequence encodings its read accuracy can be affected by several factors such as array size, interconnect resistance, and Fermi energy deviations from HOMO levels of DNA strands employed in the crossbar. We study the impact of array size and interconnect resistance on the bit error rate of a DNA-ROM array through extensive Monte Carlo simulations. We have also analyzed the performance of our proposed DNA crossbar array for an image storage application, as a function of array size and interconnect resistance. While we expect that future advances in bioengineering and materials science will address some of the fabrication challenges associated with DNA crossbar arrays, we believe that the comprehensive body of results we present in this paper establishes the technical viability of DNA crossbar arrays as low power, high-density storage devices. Finally, our analysis of array performance vis-à-vis interconnect resistance should provide valuable insights into aspects of the fabrication process such as proper choice of interconnects necessary for ensuring high read accuracies. 
    more » « less
  4. Polymerase chain reaction (PCR) has long been the mainstay in genetic sequencing and identification. Irrespective of whether short read or long read technologies are adopted, PCR methods are generally time consuming and expensive. Recently, an all-electronic approach, the so-called Single Molecule Break Junction (SMBJ) method, has been proposed as a possible alternative to PCR. In this article, we evaluate the performance of four different classifier models on the current signatures of ten short strand sequences, including a pair that differs by a single mismatch. We find that a gradient boosted tree classifier model achieves impressive accuracies, ranging from approximately 96% for molecules differing by a single mismatch to 99.5% otherwise. 
    more » « less
  5. Deoxyribonucleic acid (DNA) has emerged as a promising building block for designing next-generation ultra-high density storage devices. Although DNA is highly durable and extremely high density in nature, its potential as the basis of storage devices is currently hindered by limitations such as expensive and complex fabrication processes and time-consuming read-write operations. In this article, we propose the use of a DNA crossbar array architecture for an electrically-readable Read-Only Memory (DNA-ROM). For DNA-ROM, we have chosen two DNA strands for representing Bit 1 and Bit 0 respectively. DNA charge transport has been studied through a contact-DNA-contact setup. The results obtained from the DNA charge transport study have been used to analyze the crossbar array. The performance has been analyzed by loading an image onto a 128×128 crossbar. For this application, we have observed a bit error rate of 4.52% and power consumption of 6.75 µW. 
    more » « less
  6. null (Ed.)
  7. Abstract BackgroundThe all-electronic Single Molecule Break Junction (SMBJ) method is an emerging alternative to traditional polymerase chain reaction (PCR) techniques for genetic sequencing and identification. Existing work indicates that the current spectra recorded from SMBJ experimentations contain unique signatures to identify known sequences from a dataset. However, the spectra are typically extremely noisy due to the stochastic and complex interactions between the substrate, sample, environment, and the measuring system, necessitating hundreds or thousands of experimentations to obtain reliable and accurate results. ResultsThis article presents a DNA sequence identification system based on the current spectra of ten short strand sequences, including a pair that differs by a single mismatch. By employing a gradient boosted tree classifier model trained on conductance histograms, we demonstrate that extremely high accuracy, ranging from approximately 96 % for molecules differing by a single mismatch to 99.5 % otherwise, is possible. Further, such accuracy metrics are achievable in near real-time with just twenty or thirty SMBJ measurements instead of hundreds or thousands. We also demonstrate that a tandem classifier architecture, where the first stage is a multiclass classifier and the second stage is a binary classifier, can be employed to boost the single mismatched pair’s identification accuracy to 99.5 %. ConclusionsA monolithic classifier, or more generally, a multistage classifier with model specific parameters that depend on experimental current spectra can be used to successfully identify DNA strands. 
    more » « less